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ABSTRACT 



This paper presents the solution to the minimum transition time 
problem and the minimum fuel problem for a second order system with 
one zero. The control difficulties usually encountered due to dis- 
continuous action of the error states are eliminated by a transformation 
to a system with continuous variables. Optimum control of the trans- 
formed system is then accomplished using the methods of Pontryagin /l/. 
The control action is then related back to the original plant. 

Although the investigation is concerned entirely with a second order 
oscillatory system, the method is sufficiently general to be extended 
to the higher order system with zeros. 

The writer would like to thank Associate Professor Harold A. Titus, 
U. S. Naval Postgraduate School, who suggested the problem and whose 
continued cooperation, encouragement and assistance made the results 
possible . 
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1. Introduction. 



When controlling the performance of a system, it is often desir- 
able to choose the control that will minimize errors in the system and 
do it in the shortest possible time. A practical matter that must be 
considered in the optimization in relation to rapid action is the fact 
that control is of a bounded nature. In a great many important cases, 
the constraint on the magnitude of the control effort precludes the use 
of classical variational techniques to design the controller. 

In 1956 Pontryagin hypothesized his "maximum priciple" which 
has since been proven a necessary condition for the optimization of 
linear systems in relation to rapid action /l/. In solving the mini- 
mum time problem for linear systems with bounded control, the 
principle leads to a "bang-bang" form of control law. This implies 
that the control effort is always being applied at its maximum value. 
There remains, however, the task of finding the optimum time to 
switch the control. Pontryagin' s method leads to a rule for switching 
the controller which is a function of the initial conditions in the 
system adjoint to the one being controlled. Generally these initial 
conditions are difficult to find. 

It is usually helpful to consider the control problem using state 

th 

space techniques. The coordinates of the space for an n order 
system here are a displacement error and its n-1 time derivatives. 
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The space may be divided into two regions each of which is character- 



ized by the control optimal for the trajectories in that region. Optimum 
switching between the two conditions of the bang-bang control occur 
on the hypersurface dividing the space. The switching criteria can 
then be stated as a function of the state space variables. 

Of considerable value in finding the switching surface is the sys- 
tem adjoint to the system. The adjoint can be thought of as the system 
running in reverse time. By plotting trajectories from the origin of the 
error phase space "backwards" in time, with the control satisfying the 
respective adjoint variables, a surface is generated which may be re- 
lated to the optimal switching surface in the system state space. 

A problem of interest occurs when the system is of such a nature 
that when control is applied, a discontinuity appears in one or more 
of the system states. This may happen when the control is of a bang- 
bang form and the forward transmission path of the system contains 
zeros. It could also show up if the control is of such a form that it 
approximates an impulse to the system. When there are discontinui- 
ties in the state space due to switching it is generally no longer 
possible to write the switching criteria as a function of the state 
space variables. 

One alternative might be to switch the control as a function of 
time. This may be done effectively when the number of switchings 
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to reach the origin of the error state space is no more than n-1 in an 
th 

n order system. Such a restriction limits one mainly to considering 
only those systems with real, distinct eigenvalues. Large disturbances 
in lightly damped (oscillatory) systems may require more than n-1 
switchings to zero the error states. The most important consideration 
when controlling as a function of time is the means of implementing 
the switching logic. To accomplish time dependent control, it is vir- 
tually mandatory that a digital computer be inserted in the control loop. 

Another approach to the problem is to find a system that reacts 
identically to the system with zeros except at the points of disconti- 
nuity. Control of this parallel system can be stated in terms of the 
state space variables. This logic can then be used to switch the 
original plant. 

This paper will be an investigation into the latter method. The 
problem is as follows: 

Given a second order oscillatory system with one zero, 
find the optimum control for zeroing the errors in the 
system in minimum time and for zeroing the errors with 
minimum fuel. 

The method of Pontryagin is used to solve the problem. The brief 
description of the method presented here is based on the work of 
Rozonoer / 1/. 
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2 . Pontryagin's maximum principle . 



Given the system state variables described by n first order 
differential equations 

x. = f. (x,u,t) i=l,......,n (1) 

where x is a column vector in phase space and u is a column con- 
trol vector consisting of r control elements. 

The control u(t) must belong to a closed subset U of admissi- 
ble controls and must be piecewise continuous. The trajectory x(t) 
in the phase space is uniquely determined by (1) when control ujt) 
and the initial conditions 



x(0) = x° 




x 



o 

n 



( 2 ) 



are given. 

The control u(t) of a system may be considered optimum under a 
variety of criteria. A large class of optimization problems may be 
solved by presenting the criteria in such a way that the solution is 
attained by minimizing a linear function of the final value of the state 
space variables. A control must be selected from U that will trans- 
fer the system (1) from x° to some fixed closed set G of the phase 
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space such that 



n+1 

S = L c.x.(T) (3) 

1 1 1 

is a minimum. The constants c, and the x , coordinate are chosen 

i n+1 

such that minimizing (3) optimizes the system. 

In a great many cases optimization of only one of the coordinates 
of the system is desired. For example, in order to optimize the mag- 
nitude of ^ 

f F(x(t), u(t)) dt (4) 

J o 

for T and x(T) either fixed or free in a system (1) for u(t)eU, a 
new variable is introduced." 

x n+ i = F (x(t) < iiW) dt (5) 



and another differential equation 

* n+1 = F(x(t), u(t)) 

is added to (1). The problem of optimizing the integral leads to 
optimizing x^ + ^(T) at t = T. 

Minimizing x ^ (T) in the system (1) with x ^(t) adjoined is 
accomplished by putting the problem in functional form (3) and apply- 
ing the maximum principle to gain the solution. That is 

n+1 

S = L c x (T) = x (T) (G) 

l li n+1 
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is the functional to be minimized. Here it may be seen that 

. . . , c =0 and c . = 1 . 
n n+1 

A new dependent variable £(t) is now formed such that 



n+1 Bf (x, .u, t) 



The function 



p.(t) = -E 

1 I s 3x. 



n+1 

H = L p f (x, u, t) 



i = 1 ,n+l (7) 



1 



s s 



( 8 ) 



is introduced from which equations (1) and (7) may now be written 



3H 

x. = t 

i Bp. 

l 



3H 

P i 3x. 

l 



i = 1 n+1 (9) 

The control _u*(t) is said to satisfy the maximum condition if 

H(x*(t) ,£.*(t) ,u*(t)) reaches an absolute maximum at each time t 

(0 ^ t ^ T) where x* (t) and £*(t) are the values of the variables at 

time t with u*(t) e U controlling. For linear systems of the type 

discussed in this paper, the necessary and sufficient condition for 
n+1 

minimizing S = I c.x^(T) optimally with admissible control is that 
the control satisfy the maximum condition. 

To use the maximum principle, H is formed and maximized with 
respect to u(t) . This produces a 

u*(t) = 0 (x, £) (10) 

which may be used with equations (9) and the boundary conditions 
to find u*(x). If the end point of x(t) is not fixed, it becomes 
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necessary to obtain boundary conditions on £(t) in order to arrive at 

a solution. The conditions £(T) may be found using a function 

F(x) ^ 0 which describes G and (T) cG, the end point of an opti 

mum trajectory. The form of jd(T) will be stated without detailed 

explanation; however, it may be noticed that at time t = T, jd(T) is 

n+1 ^ 

orthogonal to a hyperplane E a,(x. - x, ) = 0 through the endpoint of 

1 1 1 1 

the trajectory and directed toward that portion of G where 
n+1 n+1 i 

E c.x, ^ E c.x.(T). The coefficients a. may be expressed as a 
xiii i 

^ 1 1 

linear combination of the c. and b^(x (T)), the latter being coeffi- 
cients of a hyperplane through x*(T) bracketing Go 
Thus 

P t (T) = -Ac. -jib.^CT)) (11) 

where X and /i are non-negative numbers one of which may be set 
equal to unity as it is only the ratio that is important. 

Generally, three situations arise as to final boundary conditions 

(i) If x^(T) are specified for i = 1,2, ,m then these 

become the boundary conditions for (9) . 

(ii) If x| (T) are internal points of G for i (1 ^ i ^ n+1) 
then b..(x^(T)) = 0 and p,(T) = -c.. 

(iii) If x^(T) are boundary points of G for some i (1 ^i^n+1) 
then F(x(T)) = 0 and the p^(T) are as in (11). 

When F is differentiable, the bracketing hyperplane through 
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(12) 



x* has coefficients 



b ifc 1 ®)-!ir 



x. = x}(T) 
1 1 



If finding the optimum control for minimum transit time another condi- 
tion must be fulfilled since T is not fixed beforehand. This condition 
is that H(T) = 0. 



3 . Development of system equations . 

The equation of a second order system with zeros may be written 
c + 2£toc + cu s c = a^u + a 0 u (13) 

where c is the output variable of the system and u is the output of 
a controller. 

This paper is concerned with control of similar systems that are 
purely oscillatory in nature i.e. £ = 0. To facilitate ease of com- 
putation in the analysis, equation (13) is scaled to 

c'+ c = a ^ u + u (14) 

which when written in terms of the Laplace transform of the output 
variable becomes 

(a s + l)U(s) 

C(s) = — (15) 

s + 1 

This system is represented in block diagram form in Fig. 1. 
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r\_ e 


CONTROL 
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ais + 1_ 


c 


“ s 






s 2 + 1 






i 















Fig. 1 - Block diagram of control system 
The response of the system to a step input is investigated more 
readily by means of the error variable 

e = r - c (16) 

If the input r is fed forward as in Fig. 2, the Laplace transform of 
the error, given 

c(0) = c° 

c(0) = c° 

R(s) = r 0 /s (17) 

becomes 

E ( s ) = (r 0 ~ c°)s - (c° + air n ) - (ais + l)U(s) 

s s + 1 

Now the problem of zeroing the error states reduces to that of zeroing 
the error initial conditions in the system. 
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Fig. 2 - Controlled system with input fed forward 
Finally with the introduction of state space variables 



ei = -e 
ez — ei 



(19) 



the system equations can be written in vector matrix notation /3/ 

( 20 ) 





0 1 




0 


e = 




e + 




— 


-1 0 




aiu + u 



4 . The minimum time problem. 

The problem is stated as follows: 

Given the system (20) and a control force of bounded 
magnitude |u| ^ 1, find the optimum control u*(t) to 
transfer the state variables from some initial point in 
the phase space to the origin of the phase space in 
minimum time T. 
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That is , given 



e.(0) = e° 

e(T) = 0 

M * 1 

and the system (20), find uf(t) such that 

r T 

S = adt 
J o 

is a minimum where a is a positive constant. 
Introduce 



e n+l = e * 



= S = i > 



dt 



The system equations then become 





"o i 6" 




0 


e_ = 


-10 0 


e + 


ai u + u 




0 0 0 

i 




a 



( 21 ) 



( 22 ) 



(23) 



(24) 



Because of (21) , the functional 

3 

S = £ c,e, (T) = c 3 e 3 (T) (25) 

1 1 1 

and since we wish to minimize this, c 3 = 1 is chosen. e 3 (T) is not 

limited, hence the boundary condition becomes 

P 3 (T) = -c 3 = -l (26) 

By (8), the hamiltonian becomes 

H = Pie 2 - p 2 ei + p 2 (axu+ u) + p 3 a (27) 

-8 H 

Since p 3 = -r =0 it is evident that p 3 is a constant and there- 

o e 3 



fore p 3 = p 3 (T) = -1 and now H is 
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H = Pie 2 - p 2 ei + p 2 (aiu + u) - a 



(28) 



which is maximized in u if 

aiii + u = N[sgn p 2 ] (29) 

where N = max |aiu + u| for each fixed t(0 ^ t ^ T) . The control 
u*(t) which satisfies these conditions is a "bang-bang" type control 
where u = ±1 at all times and u at the moment of switching is 
unbounded. 

Since e 3 has served its purpose in the optimization process, 
we may now return to the second order system and solve for the 
"impulse" variables . By (9), 

£ (3 0) 

and the solution for u*(t) becomes 

u*(t) = 1 • sgn[cos(t + ©) ] (31) 

where © is a phase angle dependent on x°. 

Several properties of the optimum controller are now known. 
First, the control is a bang-bang type which applies maximum effort 
at all times in one of the two "directions". It is switched periodi- 
cally from one state to the other every half cycle until the origin is 
reached. Notice that each time the control is switched, a disconti- 
nuity appears in the e 2 variable. This occurs because u contains 
an impulse . 



£ = 
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( 32 ) 



n s r s + - 

A e 2 = (-e x + a x u + u) dt = a x iidt = a x [u(t ) - u(t ) J 

S S 

s s 

where A e 2 is the discontinuity in e 2 at the time of switching t . 

s 

One would now like to find a switching curve L(e) which 
divides the phase plane e x vs e 2 in such a manner that control 
u* = +1 is optimum in the space to one side of the curve and u* = -1 
elsewhere. Control would be switched when the trajectory e* (t) 
crosses the curve. The discontinuity Ae 2 precludes this possi- 
bility. For example, examine the trajectory e_*(t) for some initial 
conditions that dictate u* = -1 for optimum control. At the point 
where this trajectory crosses L(e) the optimum becomes u* = +1. 

The control switches and A e 2 = +2a x occurs which places the states 
back in the space where u* = -1 was optimum. Here the control 
switches again, Ae 2 = -2a x occurs and chatter motion begins. The 
fact that e 2 is multiple valued at the instant of switching makes a 
simple realization of L(e) impossible. 

For periods between switchings where u = 0, the system is well 

t h 

behaved with the solution for the k interval 

e x (t) = K cos (t + 0ic) - 6 



(33) 



e 2 (t) = K cos (t + 0ic + 7 t/2 ) 

where 6=1* sgn p 2 and K,0ic depend on conditions of states at 
th 

the start of the k interval. 
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4 . 1 The transformed variable. 



The search for a variable of the system on which to control leads 

to the possibility of "subtracting out" the discontinuity present in 

e 2 at times of switching. 

The Laplace transforms of the system variables are 

t"* _ e?s + eg + (axs + 1) U(s) 

E ‘ (s) 



t, ,_ x egs - e? + s(axS + 1) U(s) 
Eats) = a ; 



(34) 



where 



TT , ^ 1 2 -tjs 2 -t 2 s 

U(s) = 5 ( e + — e 

s s s 



) (35) 



which for any instant of time t (0 ^ t < tx) 
U(s) = 



(3 6) 



Equations (34) then become 



Ex(s) = 



eg s 2 + (eg + ai6) s + 6 



s (s 2 + 1) 



t". / _ \ (eg + ax6) s + (-eg + 6) 

E 2 (s) = 

s 2 + 1 

By means of the initial value theorem, it is seen that 

lim ei (t) = lim sEx (s) = eg 
t-+ o s- 00 

lim e 2 (t) = lim sE 2 (s) = eg + ax5 
t-0 s-* 00 



(37) 



(38) 



At time t = 0, e 2 jumps to eg + a x 6 . To remove this discontinuity 
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consider the transformed variables 



Yi(s) = E x (s) 

Y 2 (s) = E a (s) - ^ 
s 



( 39 ) 



By virtue of (39) and (20) 



sYi(s) = sEx(s) = E a (s) = Y a (s) + 

sY 2 (s) = sE 2 (s) - a x 6 = -Yx(s) + — 

s 



(40) 



or 



0 1 



ai 

Y + 1 6 



(41) 



where 6 is a unit step function with sign to be determined. The 
system (41) is identical to that of (20) except for the action at 
time of switching. It should be noted, however, that care must be 
taken in assigning final values to the system described by (41) if 
the two plants are to be controlled in parallel. The final value 
theorem and (39) gives 



From this it is observed that zeroing the final states in (20) is 
analagous to zeroing yi (T) and attaining a final value 




(42) 



y 2 (T) = -ax6 



(43) 



in the system (41) . 
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4 . 2 Boundary conditions and final control. 



From (38) and (39) it is clear that the initial conditions on 
the e and y variables are identical. From (39) it is also seen 
that 



(44) 



Yi(T) = ej.(T) 

Ys(T) = e s (T) - 3x6(1) 

At this point in the pursuit of the optimum control, it becomes neces- 
sary to investigate the system action possible at time t = T under 
admissible control. Ae 2 of (32) provides a means of changing the 
value of e 2 instantaneously by an amount dictated by the constraints 
on u(t). With this in mind, it is noted that appropriate use of Ae 2 
within the bounds of allowable control may zero the e 2 variable in 
zero time given that e 2 (T) is within range. The conditions (21) 
and (44) with (32) indicate that for 

|y 2 (T) | ^ a x (45) 

the system (20) may be zeroed instantly'*'. The boundary conditions 
on (41) then become 

i = 1 ,2 

Yi(T) = 0 (46) 

|ys (T) 1 £ ax 



y.(0) = y° = e? 
i i i 



The conditions are stated in terms of the y variable for con- 
venience in order that notational problems arising from multiple 
value of e 2 (0) be avoided. 
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The final controller u 2 (T) that must zero the errors for t > T 



has two conditions imposed upon it i.e. , 



aiii 2 + u 2 — 0 
u a (T) - 6 (T) = ■ 
The solution to (47) is 

- ya(T) 



u 2 (t) = 



a i 



-ea(T) 

a i 



exp ( 



(47) 



-t+T 

a i 



t * T 



(48) 



It is assumed then that u 2 (t) is available at time t = T so that the 
boundary conditions on the system are as stated in (4 6 ). 



4 . 3 Switching functions . 

The method of finding a function L (y) with which to describe 
the switching criteria for the optimum trajectory proceeds as follows. 
As in the discontinuous case, it is desired that 

r T 

S = I adt (49) 

J o 

be minimized, therefore, another variable y 3 = S = c 3 y 3 (T) is 
adjoined to the system and once again Ci = c 2 =0. The hamiltonian 
becomes 

H = Piy 2 + Pia x 6 - P 2 yi + P 26 - cc (50. 



This is maximized in 6 when 

6=1* sgn (aj.Pi + p 2 ) (51) 

With this control, trajectories are circular about ( 6 , -ai 6 ) with 
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radius determined by y°. (See Fig. 3) 




6 = -1 

- 6 = +1 

■> direction of 

positive time 



Fig. 3 yi vs y 2 phase plane with trajectories for 6 = ± 1 
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Previous arguments have determined that the conditions on the 



system are 

Y i (0) = y° i = 1,2,3 

Yi(T) = 0 

|ya(T) | * a x (52) 



H(T) = 0 

Ps(T) = -1 

The function F = — (y| - a x ) ^ 0 may be used to describe G. 

From this 

= Yz (53) 

Yz = Yz 



b 2 (ys(T)) = -|~- 

°Yz 



and 



Ps(T) = -Xc 2 - pb 2 (y 2 (T) ) = -piyi (54) 

where n ^ 0 with modulus such that F(T) = 0. In the phase plane 
of Px vs p 2 it is sufficient to note that for trajectories terminating 
at y^CT) = -a X/ p 2 (T) ^ 0 and for trajectories ending at y^fT) = +a Xs 
p 2 (T) 0. This information in addition to the control (51) com- 

pletely define L(y) for trajectories ending on the extremes of the 
line segment |y 2 (T) | ^ a x . 

Fig. 4 depicts representative action for optimum trajectories 
terminating at y 2 (T) = -a x , y x (T) = 0. Trajectories ending at 
y 2 (T)=+a x , y x (T) = 0 are mirror images. The optimum switching 
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curves are generated by picking arbitrary values of £(T) from the 
admissible set for the corresponding boundary values of y_(t) and 
working backwards in time plotting the switching points determined 
from £(-t) on the yi vs Yz phase plane. 
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Fig. 4 - Concurrent action of £(t) and ^(t) for y_ (T) 



For trajectories ending in the interior of the line segment where 
\y 2 (T) | < ai , b 2 (y 2 (T)) = 0 and, therefore, p 2 (T) = 0. This com- 
pletes the information necessary to describe L(;y). Fig. 5 shows a 
representative trajectory arrived at by translating switching criteria 
from the £ - plane to the y_ ~ Plane. Fig. 6 portrays the curve with 
all dimensions . 





Y2 




switching curve 



Fig. 5 Pi vs p 2 and y x vs y 2 phase planes with complete switching 
curves 
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construction lines 
switching curve 



Fig. 6 - Optimum switching curve ( minimum time ) 
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Fig. 7 - Optimum trajectories of e.(t) and Y.(t) (minimum time ) 
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Fig. 8 - Optimum trajectories of e,(t) and x.(t) (minimum time) 
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Fig. 9 - Optimum trajectories of e.(t) and ^(t) ( minimum time ) 
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5 . The minimum fuel problem. 



The minimum fuel problem is solved by minimizing the. integral 

p T 

J = | ( | u | + |a x u |) dt (55) 

J o 

in the system (20) where T is not specified. It appears simpler, 
however, to once again make use of the transformed variable y.(t)» 
By minimizing 

r T 

1=1 |u|dt (56) 

J o 

in the transformed system (41), the desired result can be obtained 
provided 

i) the switchings in the time interval 0 ^ t ^ T are 
kept to a minimum. 

ii) adjustment is made at time t = T when fuel is 
consumed zeroing the error states e.(t) with the 
exponential control u s (T). 

After adjoining (56) to the system (41), the hamiltonian 
becomes: 

H = Piy 2 - p 2 y x + u(a x p x + p a ) - |u| (57) 

Since T is not specified H(t) = 0. With u(t) constrained as 
before, the control that maximizes H with respect to jD(t) is: 
u* = 1 • sgn (aj.Pi + p a ) |a x p x + p 3 I >1 

(58) 

u* = 0 |a x p x + p 2 | < 1 
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5 . 1 Initial Conditions . 



Taking the time derivative of H 

dH , , . du d I u I 

— = <a ‘ p ‘ + p => Si - dT 



( 59 ) 



it can be seen that =0 if = 0. It may also be argued that 

dt dt 

the change in the hamiltonian with time is zero if 



d u A u 

aiPi + Pa = - 7 — L = — 1 — 1 
du Au 



(60) 



Since u(t) is switching between u = 0 and u= ±1 and vice versa 
this means that the hamiltonian remains constant if the control is 
switched at a^i + p 2 = 1 * sgn (Au) , (See Fig. 10). 




Fig. 10 Switching criteria in p x vs p 2 phase plane 
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By choosing control u*(t) the hamiltonian remains at its maxi- 
mum value i.e. identically zero from time t = 0 + after initial control 
has been applied until time t = T. This control minimizes the 
integral (56) but does not necessarily minimize total fuel when fuel 
consumed at switchings is added. In order to minimize switchings, 
it appears necessary to choose the degenerate case i.e. u = 0 
until such time as a^ + p 2 = 1 • sgn (Ziu) where Au is the change 
in u(t) when turning the control on. Notice that this choice 
guarantees that H(t) = 0 for all t, 0 ^ t ^ T. With this in mind, 
the problem remains to minimize fuel in the non-degenerate case. 

For this purpose it will be considered that time t = 0 is that time 
when 

aiPi + Ps = 1 ‘ sgn (Au) (61) 

and initial control is applied. 

At t = 0 it may be verified from (61) and because H (0) = 0 

that 

P°y§ " Ps *i = 0 (62) 

5 . 2 Final boundary conditions . 

In order to investigate final value boundary conditions , the 
optimum trajectories terminating such that y].(T-At)>0 are consi- 
dered. Trajectories in the rest of the space are mirror images. As 
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in the minimum time problem, an optimum trajectory terminating at 
y a (T) = -ai,yi(T) = 0 is investigated first. The determination that 
Pa(T) ^ 0 as argued in (54) is still valid. This condition on p a (T) 
along with the fact that H(T) = 0 precludes the possibility of a 
trajectory terminating as above with u(T) = -1. The following cases 
however, do apply. Consider 

H(T) = -a!Pi(T) + u(T) [aiPi(T) + p a (T)] - |u(T) | = 0 (63) 

This condition implies that if u(T) = 0 then Pi(T) = 0 and if 
u(T) =+l then Pi(T) ^ 0 and p 2 (T) =+l. Fig. 11 portrays the locus 
of admissible points jd(T) and the switching curves generated by 
these criteria in the yivsy 2 phase plane are as in Fig. 11a. 




, Fig. 11 Admissible points jd(T) for y 2 (T) = -a 1 



27 



Yz 
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Fig. 11a Switching criteria for y 2 (T) = -a x (minimum fuel) 
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Optimum trajectories terminating on the line segment yi(T) = 
0, | y 2 (T) |< ax must be investigated in a fashion similar to that 
used with the minimum time problem. Since a final boundary point 



y 2 (T) is not fixed, we may substitute a final condition on p 2 (T) 
to reach a solution. At this point it becomes necessary to decide 
on the final value functional to be minimized. It is first noted 
that if the final control to the line segment is u(T) = 0, then 
-ai < y 2 (T) <0. (It must be remembered that investigation is of 
trajectories such that yi(T-At)> 0). If u(T) = 0 then also y 2 (T) = 
e 2 (T) and in order to minimize the fuel consumed by u 2 (T) to zero 
e 2 after time T then |e 2 (T) | = |ys(T)| must be minimized. 

If the final control is u(T) = -1 (u(T) = +1 is not possible for 

trajectories terminating on this side of the line segment) then 
e 2 (T) = y 2 (T) - ai and, therefore, |y 2 (T) - a x | must be minimized. 

In both of the above cases, it may be seen that y 2 (T) must be 
maximized on the line segment in order that fuel consumed by 
u 2 (T) to zero the error states be minimized. Therefore, the functional 
to be minimized is 



By prior arguments P 2 (T) = -c 2 = +1 and p 3 (T) = p 3 (t) = -c 3 = -1. 



3 

S = £ c.y.(T) = -y 2 (T) + y 3 (T) 



(64) 



where 




(65) 
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5. 3 Generating the switching curve segments . 



It is now helpful to look at the hamiltonian under each of the 
above conditions, i.e. , u(T) = 0 and u(T) = -1. In the first case 
u(T) = 0 

H(T) = H ( t) = 0 

yi(T) =0 (66) 

Ps(T) = +1 
-ai<y 2 (T) <0 

and 



H(T) = Pi(T)y a (T) = 0 

which implies that Pi(T) = 0. Fig. 12 shows the switching gener- 
ated by this condition. 
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Fig. 12 Switching criteria for u(T) = 0 (minimum fuel) 
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Next is considered the case where 



u(T) = -1 

H(T) = H(t) = 0 
Vi(T) = 0 
Ps(T) = +1 



( 67 ) 



and 



H(T) = Px(T)y 2 (T) - l[a lPl (T) + l]- 1 = 0 



from which 



Pi(T) = 



2 

ys(T) - a x 



( 68 ) 



Since u(T) = -1 and p 2 (T) = +1, conditions (58) are met only when 
-2 

Pi(T) < which implies y 2 (T) >0. In Fig. 13 these trajectories 

3x 

and switching curves are plotted. 
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5 . 4 The complete switching curve . 



Because T was never specified and because the fuel consumed 
at switching was handled as a side condition, a composite of all the 
calculated switching curves indicates areas in the phase plane where 
criteria for optimum control appear contradictory. In these areas, 
analysis by graphical means or actual computation will clear up the 
situation. Fig. 14 depicts the composite of the first two criteria 
analized. 




Fig. 13a Switching criteria where u(T) = -1 
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In Fig. 14, region A is an area where there is a question con- 
cerning whether it is optimum to switch for |y 2 (T) | = a x -or 
| y 2 (T) | < ax. By graphical analysis, it may be seen that it is opti- 
mum to switch so that |y 2 (T) | = ax. 

A similar contradiction between trajectories switching for 
0 < y 2 (T) < ai and -a!<y 2 (T) < 0 may also be resolved graphically.* 
The final result consisting of switching criteria to zero the errors 
in the system (20) with minimum fuel is given by Fig. 15. 




Fig. 14 Region of conflicting optimum criteria 



*Appendix I presents computational analysis of the resolving 
process . 
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Fig. 15 Switching criteria for minimum fuel, a x = 1 . 0 
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6. Realization of control. 



With L(y) providing the switching logic everywhere except 
where Yi(T) = 0 and |y 3 (T)|<ax and with u 2 available for final 
action, the controller can now be built (theoretically). 




Fig. 16 Block diagram of the controlled plant 
By subtracting ai6 from e 2 (t), the variable y 2 (t) is gener- 
ated for use in control logic (Fig. 15). At time t = T, Sw 2 
designates u 2 as the control and errors are instantaneously zeroed. 
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The complexity of the logic necessary to implement the switch- 
ing criteria may lead one to desire a simpler, quasi-optimum control. 
Two such controls are pictured in Fig. 17 for minimum time problem. 
Although detailed investigation of these controls was not carried 
out, it is submitted that both controls are close to optimum especially 
for large initial disturbances in the error space. Both controls were 
designed with the thought that chatter motion would not be tolerated, 
switching function would be linear over a large range, and exponen- 
tial control was available at end point of trajectory. 

It is further suggested that the system might be controlled on 
the error states instead of the transformed variable if constraints are 
put on time intervals between switchings so that chatter motion is 
avoided . 
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Fig. 17 Quasi-optimum control surfaces (minimum fuel) 



7 . Conclusions . 



The methods used in this paper to arrive at a solution may be 

th. 

used to good advantage in the investigation of any n order 
system with no more than n-1 zeros. The maximum priciple pro- 
vides a powerful tool in optimization, particularly for linear systems. 
Often the method of Pontryagin will indicate areas of interest to 
investigate when searching for an optimum control even if the unique 
solution is not readily forthcoming. 

The problem of controlling a plant with zeros is analagous to 
controlling a plant without zeros using an impulse- step type 
controller. Results obtained in this paper can be adapted to formulate 
the logic of this type control. 

The realization of the true optimum switching logic in a practical 
system may in many cases not be worth the effort. Quasi-optimum 
control using simple switching functions that are for the most pfart 
linear is a subject for further investigation. Settling time far the sys- 
tem is relatively insensitive to limited variations from the optimum 
when trajectories are out beyond the first cusp of the switching 
curve. The two quasi-optimum controls suggested in this paper 
concentrate on avoiding chatter motion. It may be that a control 
using "controlled chatter motion" /2/ would be acceptable in parti- 
cular systems. This type would be particularly attractive for control 
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of trajectories in the area of the first cusp when linear control 



functions are desired. 
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Appendix I 



Graphical analysis of optimum criteria interface 
Given y 0 in region B of Fig. 16 the problem is to find which 

of the two possible switching criteria is optimum, i.e. , whether 
for trajectories such that yi(T - At)>0 it is optimum to switch for 
0 < y2(T) < ai or -a 1 <y 2 (T)<0. 




Fig. 16 Region of conflicting switching criteria 
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Consider x° on curve 1 of Fig. 16 a distance R from the 
origin. Does it take less fuel to apply u{0) = -1 and zero the 
corresponding values of e. or should the degenerate case be 
chosen until such time as curve 2 is reached (where y{ = 0 and 
y 2 = R) and then switch? The fuel consumed when switching from 
curve 2 is 



ai 


when control is turned on 




T 1 - 7/2 


7T/2 


for I | u | dt 

J o 


ai 


when control turned off at y 2 


1 y 2 (t) | 


to zero corresponding e 2 (T) 



which when totalled is equal to R + tt/2. 

The fuel consumed when switching from curve 1 is 

ai when control is turned on 
r T 

T for |u | dt 
J o 

l^i “ ys(T) | to zero corresponding e 2 (T) 

The time T which is the time for the ^(t) to progress from x° 
at curve 1 to the line segment may be portrayed by use of £_(t) 
as in Fig . 17 . 
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Fig. 17 Time T as a function of and 0 2 in £ plane 



Here it is remembered that 



Pi(T) 

Pa(T) 



2 

Y s (T) - ai 
= +1 



aiPi + Pi = -1 

T = 01 + 02 + 7t/2 



(69) 



and that 



which implies 



H(0) = p?y£ - P 2 y° =0 



Pi = Yi 



(7 0) 



Ps Y2 

The total fuel from 1 then is ax + 0i + 0 2 + it / 2 + |a x - ya(T) | and 
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V 














the angles 0x and 0 2 may be related to points in the yi vs y 2 
space. 



0x = tan 



tan 



-i 



yi 



-i _ _Pi_ = 
Ps 



. Ps(T) ax - y 2 (T) 

* - tan ■ -fTro - 1311 — 2 — 



( 71 ) 



The point at which equal fuel is consumed when switching on curve 
1 or curve 2 is that point where 

R* = 2a x - y 2 (T) + 0x + 0 2 (72) 

For R > R* optimum switching is on curve 2 . 

Computation of an example in the case where a x = 1.0 revealed s 



Yi° = 


-1 


.355 


II 

ow 


2. 


140 


Yi(T) = 


0. 


0 


y 3 (T) = 


0. 


347 


R* 


2. 


533 


Fuel = 


4. 


104 
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